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Abstract. We consider the following problem: given that a finite au- 
C**) ' tomaton M of N states accepts at least one fe-power-free (resp., overlap- 

free) word, what is the length of the shortest such word accepted? We 
give upper and lower bounds which, unfortunately, are widely separated. 

Ph. 1 Introduction 



< 



Let L be an interesting language, such as the language of primitive words, or the 
language of non-palindromes. We are interested in the following kind of question: 
given that an automaton M of N states accepts a member of L, what is a good 
bound on the length £(N) of the shortest word accepted? 

For example, Ito et al. |7: proved that if L is the language of primitive words, 
then t(N) < 3N — 3. Horvath et al. [6] proved that if L is the language of 
non-palindromes, then £(N) < 3N. For additional results along these lines, see 

m- 

For an integer k > 2, a k-power is a nonempty word of the form x k . A word is 
k-power-free if it has no fc-powers as factors. A word of the form axaxa, where a 

qs , is a single letter, and x is a (possibly empty) word, is called an overlap. A word 

l/"~) ■ is overlap-free if it has no factor that is an overlap. 

0^ , In this paper we address two open questions left unanswered in pQ, corre- 

sponding to the case where L is the language of /c-power-free (resp., overlap-free) 

■"sj" \ words. For these words and a large enough alphabet we give a class of DFAs of N 

^D ■ states for which the shortest fc-power (resp., overlap) is of length A^4( lo s Ar )+°( 1 ). 

For overlaps over a binary alphabet we give an upper bound of 2 of - N ' . 
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2 Notation 

For a finite alphabet £, let £* denote the set of finite words over S. Let w = 
CoOi • • • On— i G S* be a word. Let w[i] — Oj, and let w[i..j] = a,i---a,j. By 
convention we have w[i] — e for i < or i > n, and w[i..j] — e for i > j. A prefix 
p of w is a period of w if w[i + r] = w[i] for < i < \w\ — r, where r = \p\. 

For words x,y, let x < y denote that x is a factor of y. A factor x of y is 
proper if x =£ y. Let x -< p y (resp., x < s y) denote that a; is a prefix (resp., suffix) 
of y. Let x -< p y (resp., x ^ s y) denote that x is a prefix (resp., suffix) of y and 
x^y. 
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A word is primitive if it is not a fc-power for any k > 2. Two words x, y are 
conjugate if one is a cyclic shift of the other; that is, if there exist words u, v 
such that x = uv and y = vu. One simple observation is that all conjugates of a 
fc-power are A;-powers. 

Let h : S* — > S* be a morphism, and suppose h(a) — ax for some letter a. 
The fixed point of h, starting with a € S, is denoted by h u (a) = a x h(x) h 2 (x) ■■■. 
We say that a morphism h is /c-power-free (resp., overlap- free) if h(w) is fc-power- 
free (resp., overlap-free) if w is. 

Let S m = {0, 1, . . . , m — 1}. Define the morphism /j, : £% — > T,^ as follows 

/i(0) = 01 
M (l) = 10. 

We call t = /i"(0) the Thue-Morse word. It is easy to see that 

H(t[0..n - 1]) = t[0..2n - 1] for n > 0. 

From classical results of Thue [10111] , we know that the morphism /i is overlap- 
free. From [2], we know that that /i(x) is /c-power free for each k > 2. 

For a DFAD = (Q, £, <5, qo,F) the set of states, input alphabet, transition 
function, set of final states, and initial state are denoted by Q,£,5,F, and qo, 
respectively. Let L(D) denote the language accepted by D. As usual, we have 
5(q,wa) = 5(5(q,w),a) for a word w. 

We state the following basic result without proof. 

Proposition 1. Let D = (Q,U,8,qo,F) be a (deterministic or nondeterminis- 
tic) finite automaton. If L(D) ^ 0, then D accepts at least one word of length 
smaller than \Q\. 

3 Lower bound 

In this section, we construct an infinite family of DFAs such that the shortest 
fc-power-free word accepted is rather long, as a function of the number of states. 
Up to now only a linear bound was known. 
For a word w of length n and i > 1, let 

cyc^w) = w[i..n — 1] w[0..i — 2] 

denote w's ith cyclic shift to the left, followed by removing the last symbol. Also 
define 

cyc (w) = w[0..n — 2]. 



For example, we have 



cyc 2 (recompute) = computer, 
cyc 4 (richly) = lyric. 
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We call each cyc^w;) a partial conjugate of w, which is not a reflexive, sym- 
metric, or transitive relation. 

A word w is a simple k-power if it is a fc-power and it contains no fc-power 
as a proper factor. 

We start with a few lemmas. 

Lemma 2. Let w = p k be a simple k-power. Then the word p has \p\ distinct 
conjugates. 

Proof. By contradiction. If p k is a simple fc-power, then p is a primitive word. 
Suppose that p = uv = xy such that x -< p u and yx — vu. Without loss of 
generality, we can assume that io^e. Then there exists a word t ^ e such that 
u = xt and y = tv. From vu — yx we get 

vxt — tvx. 

Using the second theorem of Lyndon and Schiitzcnberger [5] , we get that there 
exists z^£ such that 

vx = z l 

t = z 3 

for some positive integers i, j. So yx — z t+1 , and hence p — xy is not primitive, 
a contradiction. □ 

Lemma 3. Let w be a simple k-power of length n. Then we have 

n 
cy^O) = cyCj-M iffi = j (mod -). (1) 

Proof. Let w — p k . If i = i' (mod f) and i' < f , then 

cyc» = (p[»'. 2 - l} p [Q..i' - l]) k - l cy Ci ,(p). 

Similarly, if j = j' (mod ?) and j' < v> then 

cyc» = (p[?'..| - l]p[0..j' - l]) fe - 1 cyc i ,(p). 

If z' = j', then clearly cyc^w) = cycAw). If i 1 ^ j', we get that 

p[i '. _ _ i] p[ o.i' - 1] ^H?'.~ - i] P [o../ - 1] 

using Lemma [3 and hence cyc,(u>) ^ eye j(w). 

a 

Lemma 4. All conjugates of a simple k-power are simple k-powers. 
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Fig. 1: starting positions of the occurrences of q inside x 



Proof. By contradiction. Let w = p k be a simple fc-power, and let z ^ w be a 
conjugate of «?. Clearly z is a fc-power. Suppose z contains q k and z 7^ q k . Thus 
|<7 < |p|. Since w is simple q k ^ w — p k . The word a; = p k+1 contains z as a 
factor. So x = uq k v, for some words u,v ^< p. 

Note that u and w are nonempty and not equal to p since q k -f± p . Letting 
e := \p\ — \q\, and considering the starting positions of the occurrences of q in x 
(see Fig. rjj, we can write 

x[|p J w| — ie..\p l u\ — (i — l)e — 1] = x[|p-'w| — jc.^m — (j — l)e — 1] 

for every < i, j < k. Since p is a period of x, we can write 

x[\u\ — ie..|u| — (i — l)e — 1] = x[|u| — je..|u| — (j — l)e — 1] 

which means x[u — (k — \)e..u + e — 1] -< w is a /c-power. Therefore w contains 
a /c-power other than itself, a contradiction. □ 

Corollary 5. Partial conjugates of simple k-powers are k- power- free. 

The next lemma shows that there are infinitely many simple fc-powers over 
a binary alphabet for k > 2. We also show that there are infinitely many simple 
squares over a ternary alphabet, using a result of Currie [1]. 

Lemma 6. 

(i) Let p = t[0..2 n — 1] where n > 0. For every k > 2, the word p k is a simple 

k-power. 
(ii) There are infinitely many simple squares over a ternary alphabet. 

Proof. 

(i) By induction on n. For n — we have p k = k which is a simple fc-power. 
Suppose n > 0. To get a contradiction, suppose that there exist words u, v, x 
with uv y^ e and x ^ e such that p k = ux k v. Note that \x\ < \p\, so 
\uv\ > k. Without loss of generality, we can assume that \v\ > |~-|] > 2. Let 



n-l 



q = t[0..2 



We can write 



1]. We know that 



p k = fi(q k ). 



=< P /^"Mo-kl - 2]). 



Since /i is fc-power-free, the word q k 1 <7[0..|g| — 2] contains a fc-power. Hence 
q k contains at least two fc-powers, a contradiction. 
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(ii) Currie [1] proved that over a ternary alphabet, for every n > 18, there is a 
word p of length n such that all its conjugates are squarefree. Such squarefree 
words are called circularly squarefree words. 

We claim that for every circularly squarefree word p, the word p 2 is a simple 
square. To get a contradiction, let q 2 be the smallest square in p 2 . So there 
exist words u,y with uy ^ e such that p 2 = uq 2 y. We have \q 2 \ > \p\ since 
p is circularly squarefree. Therefore, if we let p = uv = xy, then |x| > \u\ 
and \v\ > \y\. So there exists t such that x — ut and v =ty. We can assume 
|i| < \q\, since otherwise \t\ — \q\ and \uy\ = 0, a contradiction. Now since 
q 2 = f x = tyut, we get that q begins and ends with t, which means t 2 -< q 2 . 
Therefore p 2 has a smaller square than q 2 , a contradiction. 

□ 

Next we show how to construct arbitrarily long simple fc-powers from smaller 
ones. Fix k = 2 (resp., k > 3) and m — 3 (resp., m = 2). Let W\ 6 Z 1 ^ be a 
simple fc-power. Using the previous lemma, there are infinitely many choices for 
W\. Let w\ be of length n. Define lUj+i £ ^m+i f° r * — 1 recursively by 

w 4 +i = cyc (w; i )ai cyc„ s -i (wj)aj cyc 2 „i-i (to»)oi • • • cyc ( „_ 1) „ i -i (wi)a, (2) 

where a^ = m + i — 1 and wq = 0. The next lemma states that Wi, for i > 1, is a 
simple fc-power. Therefore, using Corollary[5j each word cyc (wi) is fc-power-free. 
For z > 1, it is easy to see that 

\w t \ = n\wi-i\ = ri . (3) 

Lemma 7. For every i > 1, t/ie word u>i is a simple k -power. 

Proof. By induction on i. The word «?i is a simple fc-power. Now suppose that 
Wi is a simple fc-power for some i > 1. Using Lemma [3J we have cyc,- n «_i (tUi) = 

cyc (j - +f)ni -i(u)i), since i^l = ^. 

We now claim that Wi+i is a fc-power and 

w i+i = {cycoiw^ai cyc n i-i (w l )a l cyc 2n i-i (w l )a l ■■■ cyc ( ^_ 1)nt -i (wi)ai) k . 

To see this, suppose that u>i+i contains a fc-power y k such that ioj+i 7^ y k . 

If y contains more than one occurrence of ejj, then y = uat eye Jwi)aiV for 
some words u,w and an integer j. Since y 2 = ua,i eye j(wi)a,ivuai eye j(wi)a.iV ^ 
lUi+i, using © and Lemma [31 we get 

III 1 \ I -^ n i \ W i+l\ 

\y = \cyc A Wi)a,ivuai\ > -n = — : — , 
1 J ' k k 

and hence y k = Wi+i, a contradiction. 

If y contains just one a%, then y = uaiv for some words u,v which contain 
no Oi. So y k — u(avu) k ~ 1 av for a — o^. Therefore vu is a partial conjugate of 
Wi. However the distance between two equal partial conjugates of Wi in iuj+i is 
longer than just one letter, using @ and Lemma [3] 

Finally, if y contains no a,, then a partial conjugate of Wi contains a fc-power, 
which is impossible due to Lemma [U □ 



VI Hamoon Mousavi and Jeffrey Shallit 

To make our formulas easier to read, we define ao = Wi[n — 1]. 

Theorem 8. For i > 1, there is a DFA Di with 2 % ~ x (n — 1) + 2 states such that 
cyc (wi) is the shortest k-power-free word in L(Di). 

Proof. Define D\ — (Q\, S ai , o~i,qi,o,Fi) where 

Qi '■= {qi.o, gi.ij Qi,z, ■ ■ ■ , qi,n-i, qd}, 

Fi := {<7i,„-i}, 

fii(li,j, w \j]) '■= 9ij+i for < j < n - 1, 

and the rest of the transitions go to the dead state qd- Clearly we have |<5i| = n+1 
and L(D 1 ) = {cyc (wi)}. 

We define Di — (Qi, £ ai ,5i,qifl,Fi) for i > 2 recursively. For the rest of 
the proof s and t denote (possibly empty) sequences of integers and j denotes 
a single integer (a sequence of length 1). We use integer sequences as subscripts 
of states in Qi. For example, qi t o, q s j, and q s ,2,t might denote states of Di. For 
i > 1, define 

Q l+1 ~ Q t U {q i+ i,t : qt £ (Qi ~ F t ) - {q d }}, (4) 

F i+ i := {qi+i,i,t ■ 8i(qi,t,c) = qi^-i for some c 6 S az }, (5) 

if q t e Qi and c e E ai , then 6 i+ i(q t ,c) := 5i(q t ,c) (6) 
if q t ,q s G (Qi - Fi) - {q d }, ce£„,, and 5i(q t ,c) = q s , 

then Si + i(q i+ i it ,c) ■— q l+ i. s (7) 

ii q t 6 F h then 5 i+1 (q t ,ai) := <7 M and 5 i+1 (q t ,a l ^ 1 ) := q l+ is fi (8) 
if i > l,qi+i,t £ F i+ i, and 8 l (q t ,a i ^ 1 ) = q ld , 

then 6i +1 (q i+ i it ,a i ) := qij+i (9) 

and finally for the special case of i = 1 , 

52(q2,i,j,ai) := qi,j+2 for < j < n - 2. (10) 

The rest of the transitions, not indicated in (f^T)— (fTUj) . go to the dead state qd- 
Fig. l2bl depicts D 2 and D 3 . Using ©, we have |Q i+1 | =2|Q,|-2 = 2 4 (n-l) + 2 
by a simple induction. 

An easy induction on i proves that \Fi\ = 1. So let fi be the appropriate 
integer sequence for which Fi = {qf t }. Using ((6l)- (fi"0|) . we get that for every 
1 < j < n, there exists exactly one state qt £ Qi for which 5i(qt,di-x) = qij. 

By induction on i, we prove that for i > 2 if Si(q t , Oj-i) = gij, then 

xi =cycy_ 1)n i- a (wi-i), (11) 

a; 2 = iUi[0..jn i-1 - 2], (12) 

x 3 = Wi[(j-l)rJ- 1 ..n i -2]. (13) 
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are the shortest fc-power-free words for which 

Si(qi,j-i,x{) = q t , (14) 

5i(qi,o,xn)=qt, (15) 

6i(qi,j-i,xa) = qf { . (16) 

In particular, from (Tl3l) and (|T6| . for j — 1, we get that cyc (u>i) is the 
shortest fc-power-free word in L(Di). 

The fact that our choices of Xi, x%, and X3 are fc-power-free follows from the 
fact that proper factors of simple fc-powers are fc-power-free. For i = 2 the proofs 
of (|14 p — ([15] ) are easy and left to the readers. 

Suppose that fli ]) -([l6 l) hold for some i > 2. Let us prove (fT3 ]> — ([16 ]) for i + 1. 
Suppose that 

Si+i(q t ,ai) = qi,j. (17) 

First we prove that the shortest fc-power-free word x for which 

6i+i(qi. 3 -i,x) =q t , 

is x = cyc (J _ 1)n i-i(w i ). 

If qt € Qi, from ([5]) and (|T7|. we have 

q t = qfi , and 
Si+i(qt,ai) = qij. 

By induction hypothesis, the cyc (uii) is the shortest fc-power-free word in L(Di). 
In other words, we have 8i(qi,o,cyc (wi)) — qf t — qt, which can be rewritten 
using © as Si + i(q lfi ,cyc (wi)) = q t . 

Now suppose qt ^ Qi- Then by © and (fT7|) . we get that there exists t' such 
that 

t = i + l,t'; 
6i(qt>,Oi-i) = qi,j-i- 

From the induction hypothesis, i.e., (|15p and (|16[) . we can write 

*i(«i,o,u><[0..(j - IK' 1 - 2]) = ft,, (18) 

Si(qi d -i,Wi[U ~ l)n i_1 ..n* - 2]) - «/«. (19) 

In addition w,[0..(j — l)n l_1 — 2] and Wi[(j — l)n* — 1 ..ra* — 2] are the shortest 
fc-power-free transitions from qi.o to q# and from <Zi.j-i to qf t respectively. Using 
(O, we can rewrite (fTB"]) and (fl"9)) for 8i+i as follows: 

*i4-i(gi,o,ti>i[0..(j - I)" 4 " 1 - 2]) = q t ', (20) 

«5 i+1 (g 1>j _ 1) ^[(i-lK- 1 ..^-2])= % . (21) 

Note that from © and ([20]). we get 

8 i+ i(qi + i t ifl,Wi[0..(j - l)n i_1 - 2]) = q i+l<t > = q t . (22) 



VIII Hamoon Mousavi and Jeffrey Shallit 

We also have Si + i(qf i ,a i ) = <7i+i,i,o, using (|SJ). So together with (|2"T1) and (I22"j) . 
we get 

5i + i(qi,j-i,cyc( j _ 1)n i- l (wi)) = q t 

and cyCfj__ 1 \ n i-i(wi) is the shortest fc-power-free transition from qi,j-i to qt- 
The proofs of (fT"5)) and ([TrJ| arc similar. D 

In what follows, all logarithms are to the base 2. 

Corollary 9. For infinitely many N, there exists a DFA with N states such 
that the shortest k-power-free word accepted is of length N~4 l ° sN+0 ( 1 > . 

Proof. Let i = [log raj in Theorem [5] Then D = Di has 

N = 2L 1 °s"J- 1 (ra - 1) + 2 = f2{n 2 ) 

states. In addition, the shortest fc-power-free word in L(D) is cyc \w\\ 0&n \j. 
Now, using (|2]) we can write 

|cyc ( WLlognJ )|=raL lo s"J-l. 

Suppose 2* < ra < 2 t+1 - 1, so that t = |k>graj and Then log N = 2t+ 0(1), so 
i(logiV) 2 =t 2 + 0{t). On the other hand log | w\ = |Jog raj (log ra) = t(t + 0(l)) = 
t 2 + 0(t). Now 2°W = n°^ = N°^\ and the result follows. □ 

Remark 10. The same bound holds for overlap-free words. To do so, we define 
a simple overlap as a word of the form axaxa where axax is a simple square. 
In our construction of the DFAs, we use complete conjugates of (ax) 2 instead of 
partial conjugates. 

Remark 11. The Di in Theorem[S]are defined over the growing alphabet ZVn+i-i • 
However, we can fix the alphabet to be E m +\. For this purpose, we introduce 
w[ which is quite similar to Wi'. 

w[ = Wi, 

w' i+1 = cyc (m-)6 l cyc ni _i (wfibi cyc 2 „ s -i (w'^h ■ ■ ■ cyc (n _ 1)nS -i (w'^bi, 

where hi = mcim such that a is (any of) the shortest nonempty fc-power-free 
word over E m not equal to Ci, . . . , c%-\. Clearly we have \bi\ < |&i-i| + 1 = 0(i), 
and hence w[ = (ra 4 ). 

One can then prove Lemma[7]and Theorem|8]for w[ with minor modifications 
of the argument above. In particular, we construct DFADJ that accepts cyc (u4) 
as the shortest fc-power-free word accepted, and a D\ that is quite similar to Di. 
In particular, they have asymptotically the same number of states. 
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D 2 



(b) transition diagram of D3 



Fig. 2: transition diagrams 
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4 Upper bound for overlap-free words 

In this section, we prove an upper bound on the length of the shortest overlap- 
free word accepted by a DFAD over a binary alphabet. 

Let L = L(D) and let R be the set of overlap-free words over E%- Carpi 
[3] defined a certain operation \P on binary languages, and proved that ^(R) is 
regular. We prove that ^(L) is also regular, and hence ^(L) n ^(R) is regular. 
The next step is to apply Proposition Q] to get an upper bound on the length of 
the shortest word in <P(L) C\ { l r (R). This bound then gives us an upper bound on 
the length of the shortest overlap-free word in L. 

Let H = {e, 0, 1, 00, 11}. Carpi defines maps 

*,,* r :.E26->.ff 

such that for every pair h,h' G H, one has 

h = $i(a),h' = $ r {a) 

for exactly one letter a G £25 • 

For every word w G -£251 define < P{w) G E 2 inductively by 

*(c) = e,*(ow)=*i(o)M(*(«;))* r (o) {w e U; 5 ,a € S 25 ). (23) 

Expanding (|2"3"|) for w = a$a\ ■ ■ ■ a„_i, we get 

*j(ao)A*(*i(oi)) • • • M n_1 (^(an-i))A n " 1 (^r(a„_i)) • • • f*(*r(<*i))*r(ao). (24) 

For i C ^ define !?"(£) = IJxei^ 5-1 ^)' Based on the decomposition of 
Restivo and Salemi 9 for finite overlap-free words, the language &(x) is always 
nonempty for an overlap-free word x G T, 2 - The next theorem is due to Carpi 
®- 

Theorem 12. <P(R) is regular. 

Carpi constructed a DFA A with less than 400 states that accepts ^{R). We 
prove that \P preserves regular languages. 

Theorem 13. Let D = (Q,S2 > S,qo,F) be a DFA with N states, and let L = 
L(D). Then ^(L) is regular and is accepted by a DFA with at most N 4N states. 

Proof. Let 1 : Q — > Q denote the identity function, and define 770 , t]i '■ Q — > Q as 
follows 

Vi(q) = S(q,i) fori = 0,1. (25) 

For functions Co, Ci '■ Q — ► Q, and a word x = b^b\ ■ ■ ■ b n —i G ^2 1 define f x — 
Cbn-i ° ■ • • ° Cfci ° C&o ■ Therefore we have Cy ° Cx = Cxy Also by convention £ e = 1. 
So for example x G £(.D) if and only if n x (q ) G F. 
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We create DFA D' = (Q', S 2 5,S', q' Q ,F') where 

Q' = {[«, A, Co, Ci] : K , A, Co, Ci : Q ->■ Q}, 
<5'0,A,Co,Ci],a) = [C*j(a) OK 'AoC s . r(Q ),Ci °Co,Co °Ci] • 
Also let 

<?o = [hhVOiVl]: 

F / = {[K,A,Co,Ci]:Ao K ( go )gF}. (26) 

We can see that |Q'| = iV 4 ^. We claim that D' accepts *P(L). Indeed, on 
input w, the DFAD' simulates the behavior of D on <P(w). 
Let w = aoai ■ ■ ■ a n ~i E ^25, an d define 

<2>iO) = $i{a ao )n{$i{a x )) ■ ■ ■ /u n-1 (<Pj(o n _i)), 

<£ 2 (V) = A t"^ 1 (<? r (a„_i)) • • • ^(<P r (oi))* r (o ). 
Using (|24|. we can write 

$(w) = $i(w)$2(w). 

We prove by induction on n that 

^'(«0) w ) = [^iM, 7 ?^™): V*(o), V^(i)] ■ ( 27 ) 

For n = 0, we have <P(iy) = ^i(ty) = ^(w) = e. So 

^'(9o: £ ) = 9o = [hhrn^Vi] = [»7*iM)»7#2(u>))V(0))V(i)]- 

So we can assume (|27|l holds for some n > 0. Now suppose w — aodi ■ • • a n 
and write 

<5'(g ,a ai ■ ■ -a n ) 
= S'(8'(q' , a ai ■ ■ • a„_i), a„) 

= 5' ([%> 1 (™[0..ri-l]),^ 2 (u>[0..n-l]),?7 M "(0), 7 7M' l (l)] > a «) 

= [V((*l(i>»)) O7 7*i(«)[0..n-l]), 7 7* 2 ( l «[0..n-r]) o7 ? AI ™(0 r (a„)), ? 7 M "(l) ° V (0) , Vfi™ (0) or ?M"(l)] 

= [^i(w)>^# 2 (w)!^i"+ 1 (0)> 7 V ,+1 (l)] ' ( 28 ) 

and equality (|2"51) holds because 

*i(to[0..n-l])M n (^(on))=*i(«'). 

M"(^(a„))^ 2 (w[0..n - 1]) = <P 2 (», 

f* n (0)/i n (l) = /i™(01) = A« n (A*(0)) = M" +1 (0), and similarly 
M "(lK(0)=/x" +1 (l). 
Finally, using (l26l) . we have 

w E L(£>') 4=^ 6'(tf ,w) = [??#!(«,), ^(w), Co, Ci] E F' 
<*=*> »?*!(«)) °V<P 2 (w)(qa) E F 
«=* <?(w) = #i(io)<P 2 (to) g L(Z>). 

D 
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Theorem 14. Let D = (Q,S2,S,qo,F) be a DFA with N states. If D accepts 
at least one overlap-free word, then the length of the shortest overlap-free word 
accepted is 2 ( N '. 

Proof. Let L = L(D). Using Theorem[H there exists a DFA D' with N AN states 
that accepts the language ^{L). 

Since ^(R) is regular and is accepted by a DFA with at most 400 states, we 
see that 

K = #-(1) n &(R) 

is regular and is accepted by a DFA with O (N iN ) states. 

Since L accepts an overlap-free word, the language K is nonempty. Using 
Proposition [TJ we see that K contains a word w of length O (iV 47V ). 

Therefore <P(w) is an overlap-free word in L. By induction, one can easily 
prove that |#(w)| = O (2H). Hence we have |#(w)| = 2 oi - NAN \ □ 
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